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I » INTRODUCTION 


This paper is concerned with the computer implemen- 
tation cf both continuous and discrete gradient methods 
for adjusting the parameters of a dynamic system so as to 
match a specified response function as closeTy as possi- 
ble* While the basic theory of parameter optimization by 
gradient descent has been known for some time, the lim- 
itations and convergence properties of particular methods 
of computer implementation are not yet well understood. 
This paper is intended to^bc a contribution toward obtain- 
ing a better understanding of these problems* 

Continuous' parameter optimization is an appealing 
concept and a number of "adaptive control" schemes have ^ 

been based on it* The first part of this paper reviews ^ 

the- formulation of a continuous steepest descent algorithm 
and discusses its difficulties* Computer results relating 
to the nature of the gradient and the dependence of the 1 
path in parameter space on adjustment gain are given* [ 

The second part of the paper reviews briefly sev- 
eral discrete gradient optimization techniques* An algo- l 
rithm for automatic adjustment of -step size for gradient j 
descent is presented* The stability and convergence prop- 
erties of first and second order iteration schemes are 
compared and some new results are presented in the form 
of a convergence theorem* The application of discrete 
parameter optimization methods to a nonlinear dynamic 
system i s - i llust rated with an example. 
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The final section of the paper presents a formula— 
tion of a hybrid computational strategy for parameter 
optimization which includes the best features of both the 
analog and digital computer solutions. 

2. CONTINUOUS PARAMETER OPTIMIZATION 


We consider continuous dynamic systems described by 


y » F(y, t; p) 


( 1 ) 


where y is an n-vector representing the state of the 
system and p is an m-vector representing the parameters 
to be optimized, including initial conditions. The param- 
eter optimization problem under consideration is that of 
selecting p in such a way that the solution of Eq. (l) 
approximates a given function, yd(t), as closely as pos- 
sible. The particular criterion function to be used as 
a basis for parameter adjustment in this paper is given* 
by 

T 

0(p) = J Cy(t;p) - y d (t)] 2 dt ( 2 ) 

° o 

Gradient methods of optimization are based upon 
adjustment of parameters- utilizing the local gradient 
vector. That is, a parameter change vector, &p, is com- 
puted according to the rule 


Ap = - K 70(p) 


(3) 


where K is a positive definite matrix and 70 is the 
column vector 



Following the i-th such calculation 
parameter vector is given by . 





, the value of the 


(4) 


-Ci+1) 

P 




(5) 


The convergence properties of several iteration schemes 
of this type are discussed in Section 3 of this paper. 

Consider noyr the case where continuous, parameter 
adjustment is desired. It- is clear that the criterion 
function defined by Eq. ( 2 ) cannot be used directly since 
it leads to an iterative adjustment algorithm* ' Let us. 
therefore define an instantaneous performance cr* +****’ ^« 
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f c lit ° Cy(t;p(t))- ya(0] 2 (6) 

Unfortunately^ f is a functional in p(t) rather than an 
ordinary function. Consequently , the gradient vector, 
does not exist unless p(t) is a constant. But 
this contradicts the original objective of the formula- 
tion, namely, to adjust p continuously as a function of 
time. Two different approaches to the resolution of this 
dilemma have been taken. If the desired output is a vector 
of derivatives, y^Ct), with dimension equal to the order 
of the system to be optimized, then Eq. (l) may be used 
to derive a criterion function which is a simple function 
of p even when p varies with time. Specifically, if 

f Q (p) ~ F 2 (y d ,t;p) (7) 

then the gradient 

*.<*> ■ 2F H < 8 > 

exists and may be used to find a minimizing value for p 
by making use of the adjustment algorithm 

P =* - k Tf ^ ( p) {9) 

This method, sometimes called the "equation error method" 
has been used by Graupe U) , Ornstein ( 2 ) , and others in 
connection with identification problems. 

While the equation error method avoids the diffi- 
culty associated with Eq. (6), computer implementation of 
the method requires chat desired values for all of the 
system state variables be available. An alternate for- 
mulation, based on the work of Whitaker ( 3 ) and Margolis 
(Jt) does not require complete /specification of the desir- 
ed state, out leads only to an approximate gradient meth- 
od. The degree of approximation is related to the rate 
of change of adjustment of the parameters as compared to 
the natural frequencies of both uio system and the input 
process. The remainder of this paper is restricted to 
the latter iormuiation; i*e v , to circumstances where y ^ 
is a scalar function. While the basic technique to be 
described for continuous parameter adjustment is not new, 
the results pertaining to thq dynamic properties of the 
parameter adjustment process have not been previously 
published. 

2.1 The Approximate Gradient Method. 

The performance criterion Eq. (7) requires complete 
knowledge of the desired state 0 Let us consider instead 
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the criterion 

f i 

c 

where p < n , 


(e ♦ q_ e + •• ♦ q , 
C I 1 c 2 P*"i 


e ) ' 
cp* 


(id) 


is the order of the system, arid 


e . 
ci 


di 


d 

dt 


( i-1 ) ( i-1 ) 

y yd 

rTTT dt Ci— iT 


(11> 


When y (t) is given as a scalar function, error deriv- 
atives must be obtained by analog computer differentia- 
tion* In many practical situations it is possible to 
choose all the q^ « 0 , so that only the system output 
(or zero-state) is required. The quantity e c represents 
“output error 11 and parameter optimization based on Eq* 
(lO)may be called the “output error method 11 * 

Let us choose 


\ e 




e * y 
c 2 


( 12 ) 


Then, if the parameters are constant, the components of 
the gradient are given by 


ar 

"5- — • « 2(e * * q 

dp. Ci . 

Using the definition of e c x 
since yd is independent of 
be written as 


e ) (e + q e ) (13) 

c 2 C 1 c 2 

i = 1 , 2 , • e • m 

and e C 2 from Eq. (ll) and 
the parameters, Eq* ( 13 ) can 




+ q 


e ) 
c 2 




i 


+ q y^) 1 ; 
1,2, •• 


( 14 ) 


m 


where ana represent the system output an ( d its 

first derivative respe ctively * Let .us denote the influ- 
ence coefficients by the letter u so that 


u . . 



( 15 ) 


The influence coefficients can be obtained by differen- 
tiation of the system Eq* (l ) with respect to the appro- 
late parameters and solving the resulting differential 
equation in u^j (the f, sensi t ^;vi ty equation”) (£) . Analog 
computer circuits can be used for the simultaneous eval- 
uation of the and the » 

Now, if the parameters are adjusted, becomes a 
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functional and the dy^/Spj do not exist in the ordinary 
sense. Let us assume, however, that in a given solution 
interval, the variation is sufficiently slow so that Pj 
can be assumed constant* Then, a continuous gradient . 
method is based on 


» k 


3 *c ( P> 

dp i 


( 16 ) 


However, analog computer circuits based on Eq* (16) are in 
fact mechanizations of 


£>. = - k g (e , y, p) (17) 

1 AC 

_ * i 

where the vector G « Coi* 02? On! is an approxima- 

tion to ^f c which approaches ^f c as k ^ O * The 
u ij which enter into the calculation of the gf can ke 
considered subsidiary var i ables t whi ch equal the desired 
sensitivity coefficients when p - O • 

An analog computer implementation of the approximate 
gradient method (the output error method) is sho\<m in Fig# 
1 for q « O in Eq* { 1 2 ) • This figure illustrates the 
application of the method to an identification problem* 

As long as the switch S is open, the parameters are 
constant and 5 7f c * Consequently, the nature of the 
gradient can be studied in the open-loop case* Then , the 

switch can b§ Glased for examination of tha aotuai paras* 
eter adjustment path* 


2*2 The Nature of the Criterion Surface 


Let 


f 

c 


cl 




(18) 


and define the parameter offsets 6p^ by 

6 p . ° p ) ° ^ i - (19) 

i ^ 1 

whore pf°^ represents the assumed initial values of xJne 
parameters and p) ' the values which minimize f c 
Then, if the loop is open, we can expand f c as follows: 

m 5e 2 

f o (t) - [ % {5 <0) > - 6 Pi . 0(6P 2 )] <20> 

1=1 


If the 6p. are sufficiently small , second and higher 
order terms' may be neglected and 
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m 


f (t) 
c 


* I-Xi -J 


( 21 ) 


i«l 

Consequently, contours of constant 


f c (t) « C 


time 


can be found from 
m 


I 

i =1 


U (t . ) 6p . a 
1 J- J 1 


c 2 - 


,-(o) 
e (p 
c 


at any 


t.) ( 22 ) 

j 


This equation represents two parallel lines in the mrl 
dimensional space of the parameters and criterion func- 
tion* If only 2 parameters p^ and ]>£ arc present , it 
is possible to obtain a simple geometrical interpretation 
of this equation as shown in Fig „ 2, It can be seen that 
the instantaneous criterion function surface is a parabol- 
ic trough, where the initial and final parameter values 
arc indicated. The contour lines in the - P2 plane 

are straight lines, while the intersection of the trough 
with the f c - p x planes results in the familiar quad- 
ratic shape. It should be noted that Fig. 2 represents 
an instantaneous situation. As the u lj and e c change 
with time, the trough moves in such a way that its min- 
imum still crosses the desired final point (6). 


2*3 The Gradient Vector 


Considerable insight into the nature of the adjust- 
ment process is gained if the gradient (with S open, of 
course) is evaluated with a sinusoidal input. The gra- 

1 

dient is given by 


% * 2 L% ’’ll' °c U i2> 


e u I <23 ) 

c lit J 


Consider, for example, a desired response function 
obtained from a second order sy stem described by the 
relation 


A ,y 
6 J d 


V 


V 0} a y do 


( 2h ) 


where 


r° 

B , = . 

d L“3 


o - 
a 4- 


x 1 


L x j 


a r c 1 " 

d l“ a 2 _a I J 

and the coefficients a, , are constants. The 

signal x(t) is the input to the process. It is desired 
to optimize the parameters to of a model de- 

scribed by . 

ynAy + Bx , y(o) « y 


( 25 ) 
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where 


r u a 1 

r° 

0 1 

L_o, 2 -O-i j , 

= L°'3 

^ j 


using tho criterion function of Eq. (l8). The locus of 
the gradient vector can be plotted using the computer in 
the parameter plane defined by any two of the parameters* 
To further simplify tho visualization of the results, let 
ydo G 7o and - A 50 that the differences between 

Y d and y are due entirely to differences between 
and B. To compute the gradient (as defined by Eq. (23) 
the sensitivity coefficients <= dy/do^ and U4 *= 

Sy/dcy^ will be required* These coefficients are obtained 
from computer solution of two subsidiary equations, de- 
rived from dif ferentiation of Eq* (25) with respect to a 3 
and ai± respectively * The sensitivity equations for this 
case are 


_ 

o Au^ T X 

o 

“4 " aG 3 * C 4 * 

where 


(26) 



Suost i tut ion of xhe solution of Eq. (26) into Eq. (23) 
yields the ins tan tancous values of the gradient vector. 
Typical results are shown in Fig. 3, where x(t) is a 
sinusoid with a frequency of 1 rad/sec* Since the adjust- 
ment loop is open- the parameters remain constant, but the 
sensitivity coefficients and the matching error e c vary 
with time, rosul tings.i n the Lissajous-1 ike contours in the 
figure* Since for sinusoidal inputs and linear systems 
both y c j and y are sinusoidal, the error is also 

sinusoidal and becomes zero every half-cycle. From an 
examination of this figure, it is clear that if it is 
attempted to adjust parameters with a velocity propoi — 
tional to the (gradient, the motion will be oscillatory 
and may instantaneously point in an erroneous direction* 


2*4 Paths of Parameter Adjustment 

The dependence of the parameter adjustment path on 
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the parameter k in Eq, (17) is indicated in Fig. 4 for 
the example discussed previously, When k is very small t 
the path closely approximates a gradient trajectory. When, 
k 1 s large, the functions arc not equal to the re- 

spective coefficients of the gradient, but approach a 
gradient path as the 6p^ — 0 . The “scallops,* 1 on the 
trajectories are due to the oscillatory nature of the 
approximate gradient vector. Values of k larger than 
those indicated xn this figure may cause instability in 
the parameter optimization loops, 

2,5 Stability of the Parameter Adjustment Loops 

General analytical results on stability are not 
available at the preseni time, Stability in the small has 
been demonstrated by Margolis (4) for first and second- 
order systems with step inputs. Experiments using analog 
computers show that it is generally possible to find a 
value of k for which stability and convergence of two or 
three parameters is possible. However, attempts to improve 
convergence by increasing k or attempts to adjust more 
than three parameters simultaneously generally result 
either in instability or in lack of convergence (6), 

* At • the present time a general existence theorem 
insuring local stability of the parameter optimization 
technique for sufficiently small gain is lacking. Such a 
theorem would prove that a value of k can be found in 
any particular case such that, for specified classes of 
inputs and initial conditions, both stability and converr 
gence can be assured. 


3, DISCRETE PARAMETER OPTIMIZATION 
3*1 Discrete Gradient Descent 

The convergence problems encountered m continuous 
parameter variation schemes may be largely circumvented 
by making use of a discrete iterative adjustment algorithm. 
When this is. done it becomes possible to determine the 
true gradient of a given criterion function since param- 
eter changes are made only at discrete points in time* 

For example, the criterion function given by-Eq, (2) may 
be dif ferentiated to produce 

_ _ r T r ^ f 1 - 

70(p) n j 2j^y(t;p) - y^ttlj *?y(t;p) dt ( 27 ) 

o 

Since p is a constant over the interval of integration. 
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the partial derivatives of y appearing m this expres- 
sion may be obtained by solving the parameter influence 
differential equations associated with the assumed equa- 
tion for y . 

The gradient vector computed from Eq» (27) way be 
used for* discrete gradient descent employing Eq # ( 3 ). As 
in the continuous case, the convergence of this parameter 
adjustment procedure depends upon the values chosen for 
the elements of K* However, since the gradient used 
here is exact rather than approximate, discrete gradient 
descent may always be stabilized by multiplying every 
element of K by a sufficiently small scale factor. At 
the present time, an analogous statement cannot bo made 
for continuous parameter adjustment procedures utilizing 
approximate gradients* 


3 #2 The Optimum Gradient Method 


Since the criterion function, 0 . is bounded from 
below by the value zero, it follows that in any region 
where 0 is continuous there must exist at least one 
value for a scalar scale factor k say k o k* , such 
that whenever 1 ^ 0 j / 0 , 


min 
k > 0 





M 


0i ; <l) - k- 



( 28 ) 


The “optimum gradient method 11 ( 7 ) utilizes k* in Eq* 
( 3 ) i-e., the matrix K is computed anew at every cycle 
of iteration as 


K = k" I (29) 

This choice for K not only 'guarantees that the sequence 
of values for 0 converges, but also assures that the 
steps taken in parameter space are large enough to make 
the convergence reasonably rapid. 

With the restriction that only a finite .number of 
values for k may be considered, the search for k may 
bo carried out automatically by a digital computer* One 
method for accompli shing this can be based upon an ini- 
tial value for k computed from the ' Nowton-Haphson for- 
mula* If this gain value is called k° , then at the i-th 
stage of iteration 



1 


bW u) 



(30) 


and 
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Lp 


( i ) 


- k° ^ 0 ( p ^ ^ ) o 


j^(p (i) )7'0(p (i) ) ’ 

^cp <i> r^(p U) )‘ 


(31) 

Starting with this gain, a binary scale factor search may 
be conducted by determining an integer n > 0 which at 
least locally minimizes the expression 


0 (p^*\ n) e 0(p^^~ 2 n k? 70 ) (32) 


3*3 A Computational Algorithm for Scale Factor Adjustment 


A program to implement scale factor adjustment usinQ 
the approximate optimum gradient method described above 
has been written and tested (83. Fig. 5 is a flow-chart 
for this program ** 4 This algorithm includes a quadratic 
interpolation formula to permit more accurate determina- 
tion of an optimum scale factor. 

In order to wake efficient use of the binary search 
part of the algorithm illustrated by Fig. 5* it is impor- 
tant that the search begin at a good value for n * Where- 
as the full NcwVon-Raphson step (n=0) obtained from Eq* 
( 31 ) may produce good convergence in the early phases of 
iterative optimization, it has been found in numerical 
experiments that ever larger values of n are needed in 
the terminal stages unless 0 attains the value zero at 
its minimum (8). This comes about because Newton-Raphson 
iteration is based upon linear extrapolation of 0 to 
zero* When T0 approaches zero while 0 remains pos- 
itive, the computed parameter change vector grows without 
bound unless there is a corresponding increase in the 
value of n used in the binary scale factor of Eq. (32). 
Computational experience indicates that this difficulty 
may be resolved by beginning each scale factor search 
with the value for n which was found to be optimum dur- 
ing the previous search cycle ( j8) «, 

3.^ Constrained Minimization 


while application of the optimum gradient method to 
parameter optimization problems docs indeed lead to a 
convergent sequence in 0 v it cannot be assumed that the 
sequence in p also converges. This difficulty can be 


The numbers on the various blocks of this figure 
refer to^FORTRAN statement numbers. In thi s _ diagram, the 
symbol c rather than p has been used to represent a 
parameter vector 
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Fig, An Algorithm Tor Determining the Optimum 

Scale Factpr Tor Gradient Descent 
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avoided by specifying a bounded set from which p must be 
chosen* The "gradient projection 11 method devised by J#B. 
Rosen can be applied to closed convex constraint sets to 
obtain constrained minima by gradient searching ( 9 » 1.0 ) • 

A very much simpler procedure has been developed* to permit 
the use of the algorithm given in this paper in. conjunc- 
tion with independent ly constrained parameters (8); i*e. 
with constraints of the form 


P < 
v k - 


k 


(33) 


Application of such constraints often produces better con- 
vergence even in optimisation problems not naturally con- 
strained (8 ) * 


3*5 Second Order Methods 


The convergence of iterative minimization proce- 
dures can be sharpened markedly in certain circumstances 
by making use of second derivative information# In par- 
ticular, "Newton's" method, given by 



r — 

Ldp.dp j j 


-1 _ 

7 0 ( r> 


u>* 

/ 


(34) 


possesses quadratic convergence properties at a regular 
minimum of p ( 1 i ) . 

The matrix of second partials in £q . (3^) can be 

obtained by differentiating each row of Eq . (27) with the 
result 


where 



The matrix S amounts to a regression matrix since it 
links the linear dependence of the response function* 
y(tjp), to the integral squared error function, $ , 
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To avoid the computational difficulties associated 
with the determination of second derivatives t it has some- 
times been suggested that Newton iteration be modified to 
include only the S matrix in Eq. ( 36 ) ( 1 2 ) » If this is 

done, the basic iteration equation becomes 

*T P = - ~ s" 1 70(p) (40) 

This scheme has been called "Gauss-Nowton” iteration ( 13 ) » 
Depending upon the magnitude of D relative to S , it 
may not converge. More precisely, it is shown in the 
appendix to this paper that if 0 possesses a regular 
minimum at p = p G y then if S Q and D 0 denote the val- 
ues for S and D at p D „ a region of convergence for 
Gauss-Newton iteration exists about p Q if and only if 
all of the eigenvalues of the matrix 

0 =. S" A D (41 ) 

c c 

are less than one in absolute value. 

The residual error, c , existing at a minimizing 
value for p > P = P c 1 appears as a, muln pi icativc fac- 
tor in the expression for D „ Consequently, the eigen- 
values of Q will tend to be small when this error is 
small and large when it is large. Vhett y^ft) represents 
a function which can be matched exactly by a solution of 
the assumed system equation, Eq. (l), then at p = p 0 1 0 
is identically zero and all of the eigenvalues of 0 are 
likewise equal to zero. Gauss— Newton iteration reduces 
to Newton iteration in this circumstance and quadratic 
convergence is obtained utilizing only the first deriv- 
ative information contained in 3 

3-6 An Example of Discrete Parameter Optimization 

The quadratic convergence predicted for Gauss-Newton 
iteration has been observed in numerical experiments (_ 8 ) • 
These experiments involved optimization of the four dimen- 
sional parameter vector associated with the nonlinear dif- 
ferential equation 


c y * c y f sm y = 0 
2 1 

y(0) - y ( 0 ) = (42) 

The dusired response, y d (t) , was obtained from nuiaor-^ 
icai solution of this equation with a given value for c* 
An incorrect value was then taken as a starting point and 
the computer was permitted to adjust this value itera- 
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tivoly using both the optimum gradient and Gauss-Newton 
methods* Tables 1 and 2 summarize the results of this 
experiment* Table 2 shows that quadratic convergence is 
indeed obtainable through Gauss-Newton iteration even 
though only first derivative information is used* 

4* HYBRID COMPUTER IMPLEMENTATION. 

4*1 Division of the Computation Load Between AnaloQ and 
Digital Machines 

In principle, any of the methods which have been 
described could bo implemented on either a digital com- 
puter or an iterative analog computer. When realistic 
equipment limitations arc taken into accounx however, con- 
tinuous parameter adjustment is most naturally carried out 
by analog computation while iterative adjustment seems to 
be best suited to a digital computer* Both of these 
choices suffer from certain drawbacks, however. As has 
been noted, the stability of a continuous parameter ad- 
justment algorithm is very difficult to ascertain a prio- 
ri. Generally, manual intervention is required to achieve 
a loop gain producing reasonably rapid convergence with- 
out instability. On the other hand, when a completely 
digital solution to dynamic system parameter optimization 
problems is attempted, it is quite likely that an exces- 
sive amount of computer time will be required since dig- 
ital machines are not naturally suited to high speed 
iterative solution of differential equations. For most 
optimization algorithms, the best features of both types 
of machines seem to be needed* 

A combination of digital decision ana branching 
capaoili Lies and analog solution speed is available in a 
hybrid computer. With a hybrid computer possessing a 
sufficiently flexible control structure, analog computer 
potentiometers and initial conditions can be adjusted 
automatically under program control so that tnc analog 
machine effectively provides high speed subroutines to the 
digital computer whenever differential equation solution 
is required* Conversely, by monitoring the results of 
continuous parameter adjustment via analog to digital 
converters, the digital machine can assure stability in 
otherwise uncertain circumstances. Finally, an appro- 
priate mixture of discrete and continuous parameter 
optimization algorithms can be used on the sarao problem • 
under overall digital control. One might, for example, 
utilize discrete methods in large error conditions and 
continuous methods for M fine tuning” of parameters. 
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Table 1. 


Parameter Optimization by 
Method 



Optimum Gradient 


i </ 


Iteration 

Number 

c i 

°2 

C 3 

c 4 

0 

0 

.5100 

1*0 10 

1.570 

*01000 

1.853X10" 3 

1 

.5053 

1.005 

1.567 

.00539 

'*. 662 x 10 * 

2 

*5027 

1.003 

1 « 5^6 

.00391 

1 . 185 xl 0 -/l 

3 

.5012 

1 * 002 

1.565 

.00295 

3.109xl0 -5 

True 

Values 

• 5000 

1 * 000 

1.569 

.0000 

0 

Table 

2. Parameter 

Op c i mi za t ion 

by Gauss- 

-Newton 




Iterat ion 



Itera- 

tion 

Number 

c i 

C 2 

s 

C 4 

0 

0 .51000000 

1.0100000 

1.5700000 

,01000000 

1 .853x10'" ^ 

1 .50013945 

.9996625 

2 1.5690003 

.00007233 

4.428xl0” 7 

2 0 50000021 

1 .0000002 

1.5690004 

,00000068 

1.086X10' 13 

True 

Values 






. 50000000 

1.0000000 

1.5690000 

.0000000 

0 
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4*2 An Algorithm for Parameter Optimization by Hybrid 
Computation 

Figure 6 provides an example of an overall' algorithm 
suitable for implementation on a hybrid computer. The 
PEG RES subroutine appearing on this figure determines 
0(c) and 70(c) as well as the regression matrix, S , 
by analog solution of the assumed system equation and the 
associated parameter influence equations. The Gauss- 
Newton parameter change vector, designated 5 * is then 
evaluated by digital inversion of the S matrix followed 
by matrix multiplication as in Eq. (4o)* GRASER is the 
Fortran Symbolic name attached to the scale factor adjust- 
ment routine illustrated by Fig* 5* The numerous evalua- 
tions of " 0 required in the execution of the GRASER sub- 
routine are also intended to be accomplished by analog 
means. However, all of the decisions appearing at branch 
points of both Figs. 5 and 6 are realized by a digital 
prog ram. 

Both the Gauss-Newton and optimum gradient iteration 
techniques are incorporated into tins algorithm. As Fig« 

6 shows,* the routine favors the Gauns-hewton parameter 
change vector, 8 * and switches to the optimum gradient 
method during a given iteration cycle only when the Gauss- 
Newton vector fails to satisfy certain criteria. The 
algorithm provides for independent constraints on the 
values of each parameter; the region R refers to the 
n-dimcnsional box defined by these constraints. 

Since the parameter adjustment procedures used here 
are all iterative, some means for stopping the iteration 
must be provided. Fig* 6 incorporates five different 
stopping rules operating in parallel. The dtf and ^d c 
criteria refer to the percentage change in 0 and c in 
two successive iterations* When either of these variables 
falls below a value specified in advance, computation 
ceases . 

4*3 Experimental Results 

The algorithm proposed was tested using the system 
described by Eq. (42). The desired response function, 
y^Ct) , was ootained by numerical solution of this equa- 
tion with a specified parameter vector. Since a hybrid 
computer was not available, digital subroutines were used 
to simulate the necessary analog computations. 

Table 3 summarizes the results of this experiment* 
The column labeled %t n u denotes the exponent, of the op- 
timum binary scale factor found by gradient searching 
(Eq* (32) )« The entry G-N in this table indicates that 
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Table 3* 


Sequence of Parameter 
Hybrid Computer 


Estimation Produced by the 
Algorithm 


Itera- 

tion 

Number 

C 1 

C 2 

c 3 c 4 

0 

n 

0 

1.0000 

2.0000 

3.0000 1.0000 

360.3 


1 

1.5000 

1.8176 

2.5322 0.5201 

34.12 

0 

‘ 2 

1.5000 

1.6091 

1.6341 -0.2000 

1.462 

0 

3 

1.0907 

1.4029 

1.0000 -0.2000 

O.690 

0 

4 

0.8018 

1.5123 

1*3731 - 0.1412 

O.366 

0 

5 

0.7581 

1.4656 

1.3567 -0.2000 

0.270 

1 

6 

0.5460 

1.2693 

. * 

1*3978 -0.2000 

0.109 

0 

7 

4 

0.5494 

0.9792 

• 1.6642 - 0.1253 

8 . 48 xlO~^ 

G-N 

8 

0.5000 

1.0005 

1.5619 -0.0111 

2 . 49 x 10 “^ 

G-N 

9 . 

0.5001 

1.0000 

1*5691 - 0,0001 

1 . 78 x 10 “® 

G-N 

10 

O.500O 

1.0000 

1.5690 0.0000 

3 .l 6 xlO -14 

G-N 

True 

Value 

0.5000 

1*0000 

1.5690 0.0000 

1 

0 

- 

Upper 

Limit. 

1.5000 

v l>000 

3 . 5000 1 . 5000 

- 


Lower 

Limit 

0.0000 

0.0000 

1.0000 -0.2000 


•a. 
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*:: P r : tion rati - «-» ■*»«-* — hl „ 9 

5. CONCLUSIONS 


co„„ptu" 1 t iv U »iLfe r but t ff algorithms are 

difficult to TZtlZ “V: convergence properties are 
algorithms can be obtained b * ababllit f of optimizing 

peter supervising and octroi "“si" 0 “' 0 di ° ital <=— 
are naturally suited to * h k • Since analog computers 
of different! n l *• th ° hl ° h 8poed iterative solution 

tion seems to offer“onsid “ n “ lo °" di 8 i ‘«1 computa- 

«■"» f- systero^opt imi zation ^ 16 , ”" 1 “ ” « Pra “ iaal 

appendix 

Of 


Theorem r 


■- P o represent a point in parameter space at 
*t£lins minimum i , . _ 


and D, 


which %) -attains 

denote the values of the matrices S and D^at^h" ^ 
£oint. Assume that there exists an * Ho ) ui ?* ^ hXS 
Po auch that 0{ p), f0(g), y(t- ? ) I”! -? ab ° Ut 

possess uniformly convercent^ ? r y(tjp) all 
ther that the matrix Si . T or aeries. Suppose fur- 

same neighborhood. Then, provideTt^r | ve ^ he « in the 
positive definite matrix, when A* D (i) t ° ° 1B & 

Gauss-Newton formula and pM eh iS co °P ut ed by the 

P ls chosen so that 


in (l) - I 
Ip - pI < 


e ♦ 

© 


there exists an 


0 < e < t 

o 


<43 ) 


e such that 


lira 

i “'GO 




if and only if all of the eigenvalues of • • 

) a n ^ , lycnYtuu ea oa the matrix 

o o re leas than one in absolute value, 


(44) 


Proof 2 


Let 


«; (i) 


tho Ho i a if denote the difference 
the desired minimising value for n t 

i-<i) -(i> - 

- P - p 


between 

e 0 


;<*) 


and 


6p 


J£> 


( 45 ) 
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Then from Eq« ( 35 ) 

0(p> - 0i; o ) ♦ ii‘ (s o *d o ) «; . oiiph ( 46 ) 
and therefore 

?0(p) « 2 (S o + D o > 6p + o(6p 2 ) (47) 

Eq! 3 (4o^ 1 proiucL e Jhrreiur t int ° ^ Gauss ” Newt ™ formula 


Ap (i) 


- S(^ (l) ) _i (s +D ) 6 -<i) 
00 


£48) 


^y(t^\ Under the assun, P tions regarding y(t { £). and 


S <P (1/ ) “ v 0(6p (l) ) 


so Eq* (48) reduces to 


* (i) - <- 1 - s ; 1 D o * R ± ) 6 p“> 


(49) 


(50) 


where % is a remainder matrix which tends to zero a« 
6 p approaches zero. Now since * 


45 <i+l) 


,-(i) A - (i) 

$P + Ap 


E q* (50) yields the recursion relation 


fi£ ( - 1} « <- S~ A D o + R t ) 5p (i ' 


(51 ) 


(52) 


mav^be^mArf sufficientl y small, the remainder matrix, 

^ToVrlXl ']£*)] *" ~* aa " to 0 . ' 

one m absolute value. thaK 
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